SlideShare a Scribd company logo
1 of 82
Download to read offline
The Future of the Journal



Anita de Waard, a.dewaard@elsevier.com
Disruptive Technologies Director, Elsevier Labs


VU, June 21, 2010
Science is made of information...
Science is made of information...




   ...that gets created...
Science is made of information...




   ...that gets created...   ... and destroyed.
What is the problem?
What is the problem?



1. Researchers can’t keep track of their data.
What is the problem?



1. Researchers can’t keep track of their data.


2. Data is not stored in a way that is easy for authors.
What is the problem?



1. Researchers can’t keep track of their data.


2. Data is not stored in a way that is easy for authors.


3. For readers, article text is not linked to the underlying data.
The Vision   Work done with Ed Hovy, Phil Bourne,
             Gully Burns and Cartic Ramakrishnan
The Vision                                                        Work done with Ed Hovy, Phil Bourne,
                                                                  Gully Burns and Cartic Ramakrishnan

                                                 1. Research: Each item in the system has metadata
                        metadata                 (including provenance) and relations to other data items
                                   metadata      added to it.

       metadata




             metadata

                                      metadata
The Vision                                                        Work done with Ed Hovy, Phil Bourne,
                                                                  Gully Burns and Cartic Ramakrishnan

                                                 1. Research: Each item in the system has metadata
                        metadata                 (including provenance) and relations to other data items
                                   metadata      added to it.
                                                 2. Workflow: All data items created in the lab are added
       metadata
                                                 to a (lab-owned) workflow system.




             metadata

                                      metadata
The Vision                                                                         Work done with Ed Hovy, Phil Bourne,
                                                                                   Gully Burns and Cartic Ramakrishnan

                                                                  1. Research: Each item in the system has metadata
                                         metadata                 (including provenance) and relations to other data items
                                                    metadata      added to it.
                                                                  2. Workflow: All data items created in the lab are added
             metadata
                                                                  to a (lab-owned) workflow system.
                                                                  3. Authoring: A paper is written in an authoring tool which
                                                                  can pull data with provenance from the workflow tool in the
                                                                  appropriate representation into the document.

                    metadata

                                                       metadata




     Rats were subjected to two
     grueling tests
     (click on fig 2 to see underlying
     data). These results suggest that
     the neurological pain pro-
The Vision                                                                           Work done with Ed Hovy, Phil Bourne,
                                                                                     Gully Burns and Cartic Ramakrishnan

                                                                    1. Research: Each item in the system has metadata
                                           metadata                 (including provenance) and relations to other data items
                                                      metadata      added to it.
                                                                    2. Workflow: All data items created in the lab are added
               metadata
                                                                    to a (lab-owned) workflow system.
                                                                    3. Authoring: A paper is written in an authoring tool which
                                                                    can pull data with provenance from the workflow tool in the
                                                                    appropriate representation into the document.

                      metadata                                      4. Editing and review: Once the co-authors agree, the
                                                                    paper is ‘exposed’ to the editors, who in turn expose it to
                                                         metadata   reviewers. Reports are stored in the authoring/editing
                                                                    system, the paper gets updated, until it is validated.




       Rats were subjected to two
       grueling tests
       (click on fig 2 to see underlying
       data). These results suggest that
       the neurological pain pro-



    Review
                                   Revise
                    Edit
The Vision                                                                           Work done with Ed Hovy, Phil Bourne,
                                                                                     Gully Burns and Cartic Ramakrishnan

                                                                    1. Research: Each item in the system has metadata
                                           metadata                 (including provenance) and relations to other data items
                                                      metadata      added to it.
                                                                    2. Workflow: All data items created in the lab are added
               metadata
                                                                    to a (lab-owned) workflow system.
                                                                    3. Authoring: A paper is written in an authoring tool which
                                                                    can pull data with provenance from the workflow tool in the
                                                                    appropriate representation into the document.

                      metadata                                      4. Editing and review: Once the co-authors agree, the
                                                                    paper is ‘exposed’ to the editors, who in turn expose it to
                                                         metadata   reviewers. Reports are stored in the authoring/editing
                                                                    system, the paper gets updated, until it is validated.
                                                                    5. Publishing and distribution: When a paper is
                                                                    published, a collection of validated information is
                                                                    exposed to the world. It remains connected to its related
       Rats were subjected to two
                                                                    data item, and its heritage can be traced.
       grueling tests
       (click on fig 2 to see underlying
       data). These results suggest that
       the neurological pain pro-



    Review
                                   Revise
                    Edit
The Vision                                                                           Work done with Ed Hovy, Phil Bourne,
                                                                                     Gully Burns and Cartic Ramakrishnan

                                                                    1. Research: Each item in the system has metadata
                                           metadata                 (including provenance) and relations to other data items
                                                      metadata      added to it.
                                                                    2. Workflow: All data items created in the lab are added
               metadata
                                                                    to a (lab-owned) workflow system.
                                                                    3. Authoring: A paper is written in an authoring tool which
                                                                    can pull data with provenance from the workflow tool in the
                                                                    appropriate representation into the document.

                      metadata                                      4. Editing and review: Once the co-authors agree, the
                                                                    paper is ‘exposed’ to the editors, who in turn expose it to
                                                         metadata   reviewers. Reports are stored in the authoring/editing
                                                                    system, the paper gets updated, until it is validated.
                                                                    5. Publishing and distribution: When a paper is
                                                                    published, a collection of validated information is
                                                                    exposed to the world. It remains connected to its related
       Rats were subjected to two
                                                                    data item, and its heritage can be traced.
       grueling tests
       (click on fig 2 to see underlying
                                                                    6. User applications: distributed applications run on this
       data). These results suggest that                            ‘exposed data’ universe.
       the neurological pain pro-


                                                                                   Some other publisher
    Review
                                   Revise
                    Edit
What is needed to get there?
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
 information on any knowledge item created in a lab,
 including provenance/privacy/IPR rights
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
 information on any knowledge item created in a lab,
 including provenance/privacy/IPR rights
D. Semantic/Linked Data XML repositories.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
 information on any knowledge item created in a lab,
 including provenance/privacy/IPR rights
D. Semantic/Linked Data XML repositories.
E. Publishing systems as application servers
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights
D. Semantic/Linked Data XML repositories.
E. Publishing systems as application servers
F. Social change: Scientists store, track and annotate their
  work.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly    tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights
D. Semantic/Linked Data XML repositories.
E. Publishing systems as application servers
F. Social change: Scientists store, track and annotate their
  work.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly    tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements                tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights
D. Semantic/Linked Data XML repositories.
E. Publishing systems as application servers
F. Social change: Scientists store, track and annotate their
  work.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly    tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements                tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights standards bodies
D. Semantic/Linked Data XML repositories.
E. Publishing systems as application servers
F. Social change: Scientists store, track and annotate their
  work.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly    tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements                tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights standards bodies
D. Semantic/Linked Data XML repositories.         publishers
E. Publishing systems as application servers
F. Social change: Scientists store, track and annotate their
  work.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly    tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements                tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights standards bodies
D. Semantic/Linked Data XML repositories.         publishers
E. Publishing systems as application servers      publishers
F. Social change: Scientists store, track and annotate their
  work.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly      tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements                  tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights standards bodies
D. Semantic/Linked Data XML repositories.            publishers
E. Publishing systems as application servers         publishers
F. Social change: Scientists store, track and annotate their
  work.                  institutes, funding bodies, individuals
A. Workflow tools are emerging
A. Workflow tools are emerging



                                http://MyExperiment.org
A. Workflow tools are emerging
         http://VisTrails.org




                                http://MyExperiment.org
A. Workflow tools are emerging
                http://VisTrails.org




                                       http://MyExperiment.org




    http://wings.isi.edu/
B. Authoring is a part of doing science

     The Knowledge Ecosystem:
         Interlocking Cycles of Research
               Draw conclusions                  Draw conclusions

                                  Communicate

                                                                      Collect data
Collect data



Perform                                                               Perform
experiment                         Gather info                        experiment




                                   Synthesize
                Create/modify                         Create/modify
                hypothesis                            hypothesis

                                                                        Slide by Tim Clark
B. Authoring ‘ecosystems’: e.g., SWAN
                 SWAN Semantic Relationships




                Excel file            describes


      Private     makes            hasEvidence                  annotates
                                                                                       comment

                                                  publication        person
                             Claim
                                  hasEvidence              authoredBy       authorOf


                                                  publication
                                                            shareWith
                                      describes

                                       MSWORD file                               Slide by Tim Clark
B. Authoring ‘ecosystems’: e.g., SWAN
            person           SWAN Semantic Relationships
                                                                              annotates
                                                                                                        comment
    authoredBy
                                 makes             hasEvidence
                                                                                                                concept
                                                                                 annotates
                                           Claim                  publication
    shareWith       hypothesis
                                 makes             hasEvidence
                                                                                                                  gene
                                           Claim                  publication
                                                   hasEvidence                    discussedIn
            group

                                                                  publication

          Public            Excel file              describes                 describes
                                                                                                         PDFs

          Private                makes         hasEvidence                      annotates
                                                                                                          comment

                                                                publication          person
                                         Claim
                                              hasEvidence                authoredBy          authorOf


                                                                publication
                                                                          shareWith
                                                    describes

                                                     MSWORD file                                  Slide by Tim Clark
C. Metadata: HCLS SiG Scientific Discourse
C. Metadata: HCLS SiG Scientific Discourse
http://esw.w3.org/HCLSIG/SWANSIOC:
C. Metadata: HCLS SiG Scientific Discourse
http://esw.w3.org/HCLSIG/SWANSIOC:
Project Description
C. Metadata: HCLS SiG Scientific Discourse
http://esw.w3.org/HCLSIG/SWANSIOC:
Project Description
Provide a Semantic Web platform for biomedical discourse which
can be evolved over time into a more general facility for many types of
scientific discourse, and which is linked to key biological categories
specified by ontologies.
C. Metadata: HCLS SiG Scientific Discourse
http://esw.w3.org/HCLSIG/SWANSIOC:
Project Description
Provide a Semantic Web platform for biomedical discourse which
can be evolved over time into a more general facility for many types of
scientific discourse, and which is linked to key biological categories
specified by ontologies.
Discourse categories should include research questions, scientific
assertions or claims, hypotheses, comments and discussion, experiments,
data, publications, citations, and evidence.
C. Metadata: HCLS SiG Scientific Discourse
http://esw.w3.org/HCLSIG/SWANSIOC:
Project Description
Provide a Semantic Web platform for biomedical discourse which
can be evolved over time into a more general facility for many types of
scientific discourse, and which is linked to key biological categories
specified by ontologies.
Discourse categories should include research questions, scientific
assertions or claims, hypotheses, comments and discussion, experiments,
data, publications, citations, and evidence.
Our primary scientific use cases will be derived from problems in
digital scientific communications and web-based research
collaboratories supporting research in neurological disorders and
therapies.
C. Metadata: Annotation Ontology
    foaf:person             rdf:Type

                                           http://www.ht.org/
                                               foaf.rdf#me

       June 1, 2010
                                                pav:createdBy

                   pav:createdOn                                      ann:annotates                   http://anyurl.com/sf_pat01.html


                     hasTag

                                                rdf:Type
                                   hasTopic
       Tag
                                                       Atomic

           tag
                                   FMA:skull                ann:context
                                                                               onDocument

Linear skull fracture



                                                                 rdf:Type
Other annotations on the same document:
1. Atomic annotation on image (tag: “hematoma”)
2. General annotation (tag: “injury”)                                     InitEndCornerSelector
                                                                                                             init
Other annotations on similar documents:                                                                                      (304, 507)
1. General annotation (tag: “skull fracture”)                                  rdfs:SubClassOf
                                                                                                             end
                                                                                                                             (380, 618)
                                                                                      ImageSelector
                                                                                                                       Slide by Tim Clark
C. Metadata: Rhetorical Document Task




                                 Image by Tudor Groza
C. Metadata: Rhetorical Document Task




                                 Image by Tudor Groza
C. Metadata: Rhetorical Document Task




                                           Image by Tudor Groza
Call today:
discuss modeling coarse-grained rhetorical structure as
PAM (PRISM Aggregator Message) - a standard format
for transferring XML from publishers to aggregators
(used by Nature.com, and Elsevier in the future)
D. Linked Data: E.g. for Elsevier
D. Linked Data: E.g. for Elsevier




 <ce:section id=#123>
D. Linked Data: E.g. for Elsevier




                         this says
 <ce:section id=#123>                mice like cheese
D. Linked Data: E.g. for Elsevier




                                       said @anita
                                     on May 31 2010




                         this says
 <ce:section id=#123>                mice like cheese
D. Linked Data: E.g. for Elsevier

                                          but we all know
                                       she was jetlagged then


                                       said @anita
                                     on May 31 2010




                         this says
 <ce:section id=#123>                mice like cheese
D. Linked Data: E.g. for Elsevier
             immutable, $$, proprietary
                                                       but we all know
                                                    she was jetlagged then


                                                    said @anita
                                                  on May 31 2010




                                      this says
 <ce:section id=#123>                             mice like cheese
D. Linked Data: E.g. for Elsevier
             immutable, $$, proprietary     dynamic, personal, task-driven, - open?
                                                           but we all know
                                                        she was jetlagged then


                                                        said @anita
                                                      on May 31 2010




                                      this says
 <ce:section id=#123>                                mice like cheese
D. What to link? Semantic annotation grid
D. What to link? Semantic annotation grid
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim

      triple

     entity
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim

      triple

     entity                                                  Moment
               measure author/editor typesetter/production reader/data minin
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim

      triple

     entity                                                  Moment
               measure author/editor typesetter/production reader/data minin
Meansmanual

   semi-automated

automated
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim
                     Automated Copy Editing
      triple

     entity                                                  Moment
               measure author/editor typesetter/production reader/data minin
Meansmanual

   semi-automated

automated
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim
                     Automated Copy Editing
      triple

     entity                                                  Moment
               measure author/editor typesetter/production reader/data minin
                                                             Reflect
Meansmanual

   semi-automated

automated
D. A start: .XMP RDF in Elsevier’s PDFs (DC + PRISM)
E. Publishing on an Application server
E. SD as application server: an example
F. Social Change. Some next Steps:
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
 –Workshop organized by Phil Bourne @UCSD:
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
 –Workshop organized by Phil Bourne @UCSD:
  • Take one paper from his group
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
 –Workshop organized by Phil Bourne @UCSD:
  • Take one paper from his group
  • And all data that went into making that paper
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
 –Workshop organized by Phil Bourne @UCSD:
  • Take one paper from his group
  • And all data that went into making that paper
  • Including all correspondence, raw data, etc.
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
 –Workshop organized by Phil Bourne @UCSD:
   • Take one paper from his group
   • And all data that went into making that paper
   • Including all correspondence, raw data, etc.
 –Challenge: how better to represent that?
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
  –Workshop organized by Phil Bourne @UCSD:
     • Take one paper from his group
     • And all data that went into making that paper
     • Including all correspondence, raw data, etc.
  –Challenge: how better to represent that?
• 2010 - 2011: Try to gather resources, current efforts, etc. on
  virtual platform
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
  –Workshop organized by Phil Bourne @UCSD:
     • Take one paper from his group
     • And all data that went into making that paper
     • Including all correspondence, raw data, etc.
  –Challenge: how better to represent that?
• 2010 - 2011: Try to gather resources, current efforts, etc. on
  virtual platform
• August 2011: ‘FoRC: Future of Research Communication’
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
  –Workshop organized by Phil Bourne @UCSD:
     • Take one paper from his group
     • And all data that went into making that paper
     • Including all correspondence, raw data, etc.
  –Challenge: how better to represent that?
• 2010 - 2011: Try to gather resources, current efforts, etc. on
  virtual platform
• August 2011: ‘FoRC: Future of Research Communication’
  –Dagstuhl Workshop
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
  –Workshop organized by Phil Bourne @UCSD:
     • Take one paper from his group
     • And all data that went into making that paper
     • Including all correspondence, raw data, etc.
  –Challenge: how better to represent that?
• 2010 - 2011: Try to gather resources, current efforts, etc. on
  virtual platform
• August 2011: ‘FoRC: Future of Research Communication’
  –Dagstuhl Workshop
  –Involve key people (include funding bodies, libraries,
    institutions) to see where bottlenecks are
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
  –Workshop organized by Phil Bourne @UCSD:
     • Take one paper from his group
     • And all data that went into making that paper
     • Including all correspondence, raw data, etc.
  –Challenge: how better to represent that?
• 2010 - 2011: Try to gather resources, current efforts, etc. on
  virtual platform
• August 2011: ‘FoRC: Future of Research Communication’
  –Dagstuhl Workshop
  –Involve key people (include funding bodies, libraries,
    institutions) to see where bottlenecks are
  –Write white paper, implement...
F. Social Change. Some next Steps:
• Fall 2010: ‘Beyond the PDF’:
  –Workshop organized by Phil Bourne @UCSD:
     • Take one paper from his group
     • And all data that went into making that paper
     • Including all correspondence, raw data, etc.
  –Challenge: how better to represent that?
• 2010 - 2011: Try to gather resources, current efforts, etc. on
  virtual platform
• August 2011: ‘FoRC: Future of Research Communication’
  –Dagstuhl Workshop
  –Involve key people (include funding bodies, libraries,
    institutions) to see where bottlenecks are
  –Write white paper, implement...
• Throughout: Start using these tools and writing this way!
Interest to collaborate on any of these topics?




             a.dewaard@elsevier.com
Interest to collaborate on any of these topics?

A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly




              a.dewaard@elsevier.com
Interest to collaborate on any of these topics?

A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements




              a.dewaard@elsevier.com
Interest to collaborate on any of these topics?

A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
 information on any knowledge item created in a lab,
 including provenance/privacy/IPR rights




              a.dewaard@elsevier.com
Interest to collaborate on any of these topics?

A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
 information on any knowledge item created in a lab,
 including provenance/privacy/IPR rights
D. Semantic/Linked Data XML repositories.




              a.dewaard@elsevier.com
Interest to collaborate on any of these topics?

A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
 information on any knowledge item created in a lab,
 including provenance/privacy/IPR rights
D. Semantic/Linked Data XML repositories.
E. Publishing systems as application servers



              a.dewaard@elsevier.com
Interest to collaborate on any of these topics?

A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights
D. Semantic/Linked Data XML repositories.
E. Publishing systems as application servers
F. Social change: Scientists store, track and annotate their
  work.

               a.dewaard@elsevier.com

More Related Content

More from Anita de Waard

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?Anita de Waard
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataAnita de Waard
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsAnita de Waard
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesAnita de Waard
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Anita de Waard
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?Anita de Waard
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data ManagementAnita de Waard
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseAnita de Waard
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of PublishingAnita de Waard
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryAnita de Waard
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data SharingAnita de Waard
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingAnita de Waard
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumAnita de Waard
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataAnita de Waard
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016Anita de Waard
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...Anita de Waard
 

More from Anita de Waard (20)

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
History of the future
History of the futureHistory of the future
History of the future
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost Recovery
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data Sharing
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 

Recently uploaded

Diuretic, Hypoglycemic and Limit test of Heavy metals and Arsenic.-1.pdf
Diuretic, Hypoglycemic and Limit test of Heavy metals and Arsenic.-1.pdfDiuretic, Hypoglycemic and Limit test of Heavy metals and Arsenic.-1.pdf
Diuretic, Hypoglycemic and Limit test of Heavy metals and Arsenic.-1.pdfKartik Tiwari
 
Ernest Hemingway's For Whom the Bell Tolls
Ernest Hemingway's For Whom the Bell TollsErnest Hemingway's For Whom the Bell Tolls
Ernest Hemingway's For Whom the Bell TollsPallavi Parmar
 
8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital ManagementMBA Assignment Experts
 
How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17Celine George
 
What is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxWhat is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxCeline George
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...Gary Wood
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111GangaMaiya1
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfNirmal Dwivedi
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
Play hard learn harder: The Serious Business of Play
Play hard learn harder:  The Serious Business of PlayPlay hard learn harder:  The Serious Business of Play
Play hard learn harder: The Serious Business of PlayPooky Knightsmith
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptNishitharanjan Rout
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project researchCaitlinCummins3
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code ExamplesPeter Brusilovsky
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
Observing-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptxObserving-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptxAdelaideRefugio
 

Recently uploaded (20)

Diuretic, Hypoglycemic and Limit test of Heavy metals and Arsenic.-1.pdf
Diuretic, Hypoglycemic and Limit test of Heavy metals and Arsenic.-1.pdfDiuretic, Hypoglycemic and Limit test of Heavy metals and Arsenic.-1.pdf
Diuretic, Hypoglycemic and Limit test of Heavy metals and Arsenic.-1.pdf
 
Ernest Hemingway's For Whom the Bell Tolls
Ernest Hemingway's For Whom the Bell TollsErnest Hemingway's For Whom the Bell Tolls
Ernest Hemingway's For Whom the Bell Tolls
 
8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management
 
How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17
 
What is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptxWhat is 3 Way Matching Process in Odoo 17.pptx
What is 3 Way Matching Process in Odoo 17.pptx
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
Play hard learn harder: The Serious Business of Play
Play hard learn harder:  The Serious Business of PlayPlay hard learn harder:  The Serious Business of Play
Play hard learn harder: The Serious Business of Play
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Observing-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptxObserving-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptx
 

Vu210610futurejournal

  • 1. The Future of the Journal Anita de Waard, a.dewaard@elsevier.com Disruptive Technologies Director, Elsevier Labs VU, June 21, 2010
  • 2. Science is made of information...
  • 3. Science is made of information... ...that gets created...
  • 4. Science is made of information... ...that gets created... ... and destroyed.
  • 5. What is the problem?
  • 6. What is the problem? 1. Researchers can’t keep track of their data.
  • 7. What is the problem? 1. Researchers can’t keep track of their data. 2. Data is not stored in a way that is easy for authors.
  • 8. What is the problem? 1. Researchers can’t keep track of their data. 2. Data is not stored in a way that is easy for authors. 3. For readers, article text is not linked to the underlying data.
  • 9. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan
  • 10. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. metadata metadata metadata
  • 11. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. metadata metadata
  • 12. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. 3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document. metadata metadata Rats were subjected to two grueling tests (click on fig 2 to see underlying data). These results suggest that the neurological pain pro-
  • 13. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. 3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document. metadata 4. Editing and review: Once the co-authors agree, the paper is ‘exposed’ to the editors, who in turn expose it to metadata reviewers. Reports are stored in the authoring/editing system, the paper gets updated, until it is validated. Rats were subjected to two grueling tests (click on fig 2 to see underlying data). These results suggest that the neurological pain pro- Review Revise Edit
  • 14. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. 3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document. metadata 4. Editing and review: Once the co-authors agree, the paper is ‘exposed’ to the editors, who in turn expose it to metadata reviewers. Reports are stored in the authoring/editing system, the paper gets updated, until it is validated. 5. Publishing and distribution: When a paper is published, a collection of validated information is exposed to the world. It remains connected to its related Rats were subjected to two data item, and its heritage can be traced. grueling tests (click on fig 2 to see underlying data). These results suggest that the neurological pain pro- Review Revise Edit
  • 15. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. 3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document. metadata 4. Editing and review: Once the co-authors agree, the paper is ‘exposed’ to the editors, who in turn expose it to metadata reviewers. Reports are stored in the authoring/editing system, the paper gets updated, until it is validated. 5. Publishing and distribution: When a paper is published, a collection of validated information is exposed to the world. It remains connected to its related Rats were subjected to two data item, and its heritage can be traced. grueling tests (click on fig 2 to see underlying 6. User applications: distributed applications run on this data). These results suggest that ‘exposed data’ universe. the neurological pain pro- Some other publisher Review Revise Edit
  • 16. What is needed to get there?
  • 17. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly
  • 18. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements
  • 19. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights
  • 20. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Semantic/Linked Data XML repositories.
  • 21. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Semantic/Linked Data XML repositories. E. Publishing systems as application servers
  • 22. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Semantic/Linked Data XML repositories. E. Publishing systems as application servers F. Social change: Scientists store, track and annotate their work.
  • 23. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Semantic/Linked Data XML repositories. E. Publishing systems as application servers F. Social change: Scientists store, track and annotate their work.
  • 24. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Semantic/Linked Data XML repositories. E. Publishing systems as application servers F. Social change: Scientists store, track and annotate their work.
  • 25. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights standards bodies D. Semantic/Linked Data XML repositories. E. Publishing systems as application servers F. Social change: Scientists store, track and annotate their work.
  • 26. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights standards bodies D. Semantic/Linked Data XML repositories. publishers E. Publishing systems as application servers F. Social change: Scientists store, track and annotate their work.
  • 27. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights standards bodies D. Semantic/Linked Data XML repositories. publishers E. Publishing systems as application servers publishers F. Social change: Scientists store, track and annotate their work.
  • 28. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights standards bodies D. Semantic/Linked Data XML repositories. publishers E. Publishing systems as application servers publishers F. Social change: Scientists store, track and annotate their work. institutes, funding bodies, individuals
  • 29. A. Workflow tools are emerging
  • 30. A. Workflow tools are emerging http://MyExperiment.org
  • 31. A. Workflow tools are emerging http://VisTrails.org http://MyExperiment.org
  • 32. A. Workflow tools are emerging http://VisTrails.org http://MyExperiment.org http://wings.isi.edu/
  • 33. B. Authoring is a part of doing science The Knowledge Ecosystem: Interlocking Cycles of Research Draw conclusions Draw conclusions Communicate Collect data Collect data Perform Perform experiment Gather info experiment Synthesize Create/modify Create/modify hypothesis hypothesis Slide by Tim Clark
  • 34. B. Authoring ‘ecosystems’: e.g., SWAN SWAN Semantic Relationships Excel file describes Private makes hasEvidence annotates comment publication person Claim hasEvidence authoredBy authorOf publication shareWith describes MSWORD file Slide by Tim Clark
  • 35. B. Authoring ‘ecosystems’: e.g., SWAN person SWAN Semantic Relationships annotates comment authoredBy makes hasEvidence concept annotates Claim publication shareWith hypothesis makes hasEvidence gene Claim publication hasEvidence discussedIn group publication Public Excel file describes describes PDFs Private makes hasEvidence annotates comment publication person Claim hasEvidence authoredBy authorOf publication shareWith describes MSWORD file Slide by Tim Clark
  • 36. C. Metadata: HCLS SiG Scientific Discourse
  • 37. C. Metadata: HCLS SiG Scientific Discourse http://esw.w3.org/HCLSIG/SWANSIOC:
  • 38. C. Metadata: HCLS SiG Scientific Discourse http://esw.w3.org/HCLSIG/SWANSIOC: Project Description
  • 39. C. Metadata: HCLS SiG Scientific Discourse http://esw.w3.org/HCLSIG/SWANSIOC: Project Description Provide a Semantic Web platform for biomedical discourse which can be evolved over time into a more general facility for many types of scientific discourse, and which is linked to key biological categories specified by ontologies.
  • 40. C. Metadata: HCLS SiG Scientific Discourse http://esw.w3.org/HCLSIG/SWANSIOC: Project Description Provide a Semantic Web platform for biomedical discourse which can be evolved over time into a more general facility for many types of scientific discourse, and which is linked to key biological categories specified by ontologies. Discourse categories should include research questions, scientific assertions or claims, hypotheses, comments and discussion, experiments, data, publications, citations, and evidence.
  • 41. C. Metadata: HCLS SiG Scientific Discourse http://esw.w3.org/HCLSIG/SWANSIOC: Project Description Provide a Semantic Web platform for biomedical discourse which can be evolved over time into a more general facility for many types of scientific discourse, and which is linked to key biological categories specified by ontologies. Discourse categories should include research questions, scientific assertions or claims, hypotheses, comments and discussion, experiments, data, publications, citations, and evidence. Our primary scientific use cases will be derived from problems in digital scientific communications and web-based research collaboratories supporting research in neurological disorders and therapies.
  • 42. C. Metadata: Annotation Ontology foaf:person rdf:Type http://www.ht.org/ foaf.rdf#me June 1, 2010 pav:createdBy pav:createdOn ann:annotates http://anyurl.com/sf_pat01.html hasTag rdf:Type hasTopic Tag Atomic tag FMA:skull ann:context onDocument Linear skull fracture rdf:Type Other annotations on the same document: 1. Atomic annotation on image (tag: “hematoma”) 2. General annotation (tag: “injury”) InitEndCornerSelector init Other annotations on similar documents: (304, 507) 1. General annotation (tag: “skull fracture”) rdfs:SubClassOf end (380, 618) ImageSelector Slide by Tim Clark
  • 43. C. Metadata: Rhetorical Document Task Image by Tudor Groza
  • 44. C. Metadata: Rhetorical Document Task Image by Tudor Groza
  • 45. C. Metadata: Rhetorical Document Task Image by Tudor Groza Call today: discuss modeling coarse-grained rhetorical structure as PAM (PRISM Aggregator Message) - a standard format for transferring XML from publishers to aggregators (used by Nature.com, and Elsevier in the future)
  • 46. D. Linked Data: E.g. for Elsevier
  • 47. D. Linked Data: E.g. for Elsevier <ce:section id=#123>
  • 48. D. Linked Data: E.g. for Elsevier this says <ce:section id=#123> mice like cheese
  • 49. D. Linked Data: E.g. for Elsevier said @anita on May 31 2010 this says <ce:section id=#123> mice like cheese
  • 50. D. Linked Data: E.g. for Elsevier but we all know she was jetlagged then said @anita on May 31 2010 this says <ce:section id=#123> mice like cheese
  • 51. D. Linked Data: E.g. for Elsevier immutable, $$, proprietary but we all know she was jetlagged then said @anita on May 31 2010 this says <ce:section id=#123> mice like cheese
  • 52. D. Linked Data: E.g. for Elsevier immutable, $$, proprietary dynamic, personal, task-driven, - open? but we all know she was jetlagged then said @anita on May 31 2010 this says <ce:section id=#123> mice like cheese
  • 53. D. What to link? Semantic annotation grid
  • 54. D. What to link? Semantic annotation grid
  • 55. D. What to link? Semantic annotation grid Granularity collection document claim triple entity
  • 56. D. What to link? Semantic annotation grid Granularity collection document claim triple entity Moment measure author/editor typesetter/production reader/data minin
  • 57. D. What to link? Semantic annotation grid Granularity collection document claim triple entity Moment measure author/editor typesetter/production reader/data minin Meansmanual semi-automated automated
  • 58. D. What to link? Semantic annotation grid Granularity collection document claim Automated Copy Editing triple entity Moment measure author/editor typesetter/production reader/data minin Meansmanual semi-automated automated
  • 59. D. What to link? Semantic annotation grid Granularity collection document claim Automated Copy Editing triple entity Moment measure author/editor typesetter/production reader/data minin Reflect Meansmanual semi-automated automated
  • 60. D. A start: .XMP RDF in Elsevier’s PDFs (DC + PRISM)
  • 61. E. Publishing on an Application server
  • 62. E. SD as application server: an example
  • 63. F. Social Change. Some next Steps:
  • 64. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’:
  • 65. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD:
  • 66. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group
  • 67. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group • And all data that went into making that paper
  • 68. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group • And all data that went into making that paper • Including all correspondence, raw data, etc.
  • 69. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group • And all data that went into making that paper • Including all correspondence, raw data, etc. –Challenge: how better to represent that?
  • 70. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group • And all data that went into making that paper • Including all correspondence, raw data, etc. –Challenge: how better to represent that? • 2010 - 2011: Try to gather resources, current efforts, etc. on virtual platform
  • 71. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group • And all data that went into making that paper • Including all correspondence, raw data, etc. –Challenge: how better to represent that? • 2010 - 2011: Try to gather resources, current efforts, etc. on virtual platform • August 2011: ‘FoRC: Future of Research Communication’
  • 72. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group • And all data that went into making that paper • Including all correspondence, raw data, etc. –Challenge: how better to represent that? • 2010 - 2011: Try to gather resources, current efforts, etc. on virtual platform • August 2011: ‘FoRC: Future of Research Communication’ –Dagstuhl Workshop
  • 73. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group • And all data that went into making that paper • Including all correspondence, raw data, etc. –Challenge: how better to represent that? • 2010 - 2011: Try to gather resources, current efforts, etc. on virtual platform • August 2011: ‘FoRC: Future of Research Communication’ –Dagstuhl Workshop –Involve key people (include funding bodies, libraries, institutions) to see where bottlenecks are
  • 74. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group • And all data that went into making that paper • Including all correspondence, raw data, etc. –Challenge: how better to represent that? • 2010 - 2011: Try to gather resources, current efforts, etc. on virtual platform • August 2011: ‘FoRC: Future of Research Communication’ –Dagstuhl Workshop –Involve key people (include funding bodies, libraries, institutions) to see where bottlenecks are –Write white paper, implement...
  • 75. F. Social Change. Some next Steps: • Fall 2010: ‘Beyond the PDF’: –Workshop organized by Phil Bourne @UCSD: • Take one paper from his group • And all data that went into making that paper • Including all correspondence, raw data, etc. –Challenge: how better to represent that? • 2010 - 2011: Try to gather resources, current efforts, etc. on virtual platform • August 2011: ‘FoRC: Future of Research Communication’ –Dagstuhl Workshop –Involve key people (include funding bodies, libraries, institutions) to see where bottlenecks are –Write white paper, implement... • Throughout: Start using these tools and writing this way!
  • 76. Interest to collaborate on any of these topics? a.dewaard@elsevier.com
  • 77. Interest to collaborate on any of these topics? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly a.dewaard@elsevier.com
  • 78. Interest to collaborate on any of these topics? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements a.dewaard@elsevier.com
  • 79. Interest to collaborate on any of these topics? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights a.dewaard@elsevier.com
  • 80. Interest to collaborate on any of these topics? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Semantic/Linked Data XML repositories. a.dewaard@elsevier.com
  • 81. Interest to collaborate on any of these topics? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Semantic/Linked Data XML repositories. E. Publishing systems as application servers a.dewaard@elsevier.com
  • 82. Interest to collaborate on any of these topics? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Semantic/Linked Data XML repositories. E. Publishing systems as application servers F. Social change: Scientists store, track and annotate their work. a.dewaard@elsevier.com